Improving large scale alphanumeric string recognition using redundant information
نویسندگان
چکیده
This paper describes a framework for improving recognition performance and user experience in large scale alphanumeric listings commonly used in conversational speech applications for enterprise. The performance of these speech recognition grammars is severely impacted due to the poor recognition of alphabets. We propose a new approach based on augmenting performance through redundant semantic information. This provides additional acoustic features, which, although is redundant in the semantic space, improves performance by 30% in Canadian postal code application and serial number recognition. The additional queries for redundant semantic information are asked only when necessary: when the system makes false acceptance errors. This ensures that user satisfaction is not interrupted through needless questioning. Furthermore, we propose a way to compress the listing grammar by at least 85% in footprint with minimum performance impact due to good performance in digit recognition. This framework can be extended for general large scale alphanumeric listing grammars.
منابع مشابه
License Plate State Recognition based on Logo Matching and State Name String Classification
In most countries, vehicle license plates contain both alphanumeric characters and the state/province of origin. State/province recognition of the license plate provides additional information to the traffic management agency and guidance information, aiding in character segmentation and in recognition. Existing methods use the character string of the state name only. Unfortunately, in some cou...
متن کاملRegion Representation Using Enhanced Discrete Cylindrical Algebraic Decomposition (edcad) to Preserve the Shape and Size of the Connected Regions in Binary Images
In this paper we proposed a syntactic approach to represent any connected region taken from the binary digital image. The proposed method is an enhancement of DCAD algorithm and it provides alphanumeric string to represent the connected region, which preserves the shape and size. In enhanced DCAD we introduced two variables to differentiate the left and right bends to avoid the shape anomaly an...
متن کاملStrCombo: combination of string recognizers
In this paper, we contribute a new paradigm of combining string recognizers and propose generic frameworks for hierarchical and parallel combination of multiple string recognizers. The frameworks are open to any new achievement in either recognizers or combination algorithms, and can be applied to both machine-printed and handwritten string recognition problems. A parallel combination system, S...
متن کاملArabic Handwritten Alphanumeric Character Recognition Using Very Deep Neural Network
The traditional algorithms for recognizing handwritten alphanumeric characters are dependent on hand-designed features. In recent days, deep learning techniques have brought about new breakthrough technology for pattern recognition applications, especially for handwritten recognition. However, deeper networks are needed to deliver state-of-the-art results in this area. In this paper, inspired b...
متن کاملAn Intelligent System for Conflict Resolution in Handwritten Address Recognition
Background: The basic recognition engine of a handwritten address interpretation system, for use in postal sorting automation, is an OCR algorithm that recognises a numeric or alphanumeric string, such as a postcode, and matches it against a set of valid postal delivery points. However, the OCR system is highly vulnerable to errors due to the uncertainty that arises when the imperfect OCR resul...
متن کامل